Serveur d'exploration sur la TEI

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

METAe—Automated Encoding of Digitized Texts

Identifieur interne : 000244 ( Main/Exploration ); précédent : 000243; suivant : 000245

METAe—Automated Encoding of Digitized Texts

Auteurs : Birgit Stehno [Autriche] ; Alexander Egger [Autriche] ; Gregor Retti [Autriche]

Source :

RBID : ISTEX:C82C92A176F34CD3AE19FA346A45C9E863DBF21C

Descripteurs français

English descriptors

Abstract

This paper explains why and how the digitization project METAe applies METS (Metadata Encoding and Transmission Standard) as encoding scheme for automatically extracted metadata. In contrast to TEI (Text Encoding Initiative) and other markup languages, METS allows encoding of the whole range of structural, descriptive, and administrative metadata in a systematic way. As the METS schema permits the integration of other existing standards, it provides a highly flexible output that can be converted easily to the individual needs of digital libraries. An innovative aspect of the METAe data structure is the ALTO file (‘Analysed layout and text object’), which contains the layout structures as well as the text passages of book pages. Structural maps of the METS schema are used to compose the logical and the physical structures out of ALTO and image files.

Url:
DOI: 10.1093/llc/18.1.77


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct">
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">METAe—Automated Encoding of Digitized Texts</title>
<author>
<name sortKey="Stehno, Birgit" sort="Stehno, Birgit" uniqKey="Stehno B" first="Birgit" last="Stehno">Birgit Stehno</name>
</author>
<author>
<name sortKey="Egger, Alexander" sort="Egger, Alexander" uniqKey="Egger A" first="Alexander" last="Egger">Alexander Egger</name>
</author>
<author>
<name sortKey="Retti, Gregor" sort="Retti, Gregor" uniqKey="Retti G" first="Gregor" last="Retti">Gregor Retti</name>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:C82C92A176F34CD3AE19FA346A45C9E863DBF21C</idno>
<date when="2003" year="2003">2003</date>
<idno type="doi">10.1093/llc/18.1.77</idno>
<idno type="url">https://api.istex.fr/document/C82C92A176F34CD3AE19FA346A45C9E863DBF21C/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000057</idno>
<idno type="wicri:Area/Istex/Curation">000057</idno>
<idno type="wicri:Area/Istex/Checkpoint">000201</idno>
<idno type="wicri:explorRef" wicri:stream="Istex" wicri:step="Checkpoint">000201</idno>
<idno type="wicri:doubleKey">0268-1145:2003:Stehno B:metae:automated:encoding</idno>
<idno type="wicri:Area/Main/Merge">000265</idno>
<idno type="wicri:source">INIST</idno>
<idno type="RBID">Pascal:04-0078653</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000043</idno>
<idno type="wicri:Area/PascalFrancis/Curation">000010</idno>
<idno type="wicri:Area/PascalFrancis/Checkpoint">000035</idno>
<idno type="wicri:explorRef" wicri:stream="PascalFrancis" wicri:step="Checkpoint">000035</idno>
<idno type="wicri:doubleKey">0268-1145:2003:Stehno B:metae:automated:encoding</idno>
<idno type="wicri:Area/Main/Merge">000280</idno>
<idno type="wicri:Area/Main/Curation">000244</idno>
<idno type="wicri:Area/Main/Exploration">000244</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a" type="main" xml:lang="en">METAe—Automated Encoding of Digitized Texts</title>
<author>
<name sortKey="Stehno, Birgit" sort="Stehno, Birgit" uniqKey="Stehno B" first="Birgit" last="Stehno">Birgit Stehno</name>
<affiliation wicri:level="1">
<country xml:lang="fr">Autriche</country>
<wicri:regionArea>University of Innsbruck, Innsbruck</wicri:regionArea>
<wicri:noRegion>Innsbruck</wicri:noRegion>
</affiliation>
<affiliation>
<wicri:noCountry code="syntax">???</wicri:noCountry>
</affiliation>
</author>
<author>
<name sortKey="Egger, Alexander" sort="Egger, Alexander" uniqKey="Egger A" first="Alexander" last="Egger">Alexander Egger</name>
<affiliation wicri:level="1">
<country xml:lang="fr">Autriche</country>
<wicri:regionArea>University of Graz, Graz</wicri:regionArea>
<wicri:noRegion>Graz</wicri:noRegion>
</affiliation>
<affiliation>
<wicri:noCountry code="syntax">???</wicri:noCountry>
</affiliation>
</author>
<author>
<name sortKey="Retti, Gregor" sort="Retti, Gregor" uniqKey="Retti G" first="Gregor" last="Retti">Gregor Retti</name>
<affiliation wicri:level="1">
<country xml:lang="fr">Autriche</country>
<wicri:regionArea>University of Innsbruck, Innsbruck</wicri:regionArea>
<wicri:noRegion>Innsbruck</wicri:noRegion>
</affiliation>
<affiliation>
<wicri:noCountry code="syntax">???</wicri:noCountry>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="j">Literary and Linguistic Computing</title>
<title level="j" type="abbrev">Lit Linguist Computing</title>
<idno type="ISSN">0268-1145</idno>
<idno type="eISSN">1477-4615</idno>
<imprint>
<publisher>Oxford University Press</publisher>
<date type="published" when="2003-04">2003-04</date>
<biblScope unit="volume">18</biblScope>
<biblScope unit="issue">1</biblScope>
<biblScope unit="page" from="77">77</biblScope>
<biblScope unit="page" to="88">88</biblScope>
</imprint>
<idno type="ISSN">0268-1145</idno>
</series>
<idno type="istex">C82C92A176F34CD3AE19FA346A45C9E863DBF21C</idno>
<idno type="DOI">10.1093/llc/18.1.77</idno>
<idno type="local">180077</idno>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">0268-1145</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Automation</term>
<term>Coding</term>
<term>Digitizing</term>
<term>Electronic library</term>
<term>Metadata</term>
<term>Standardization</term>
<term>Standards</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr">
<term>Automatisation</term>
<term>Bibliothèque électronique</term>
<term>Codage</term>
<term>Métadonnée</term>
<term>Normalisation</term>
<term>Norme</term>
<term>Numérisation</term>
</keywords>
<keywords scheme="Wicri" type="topic" xml:lang="fr">
<term>Automatisation</term>
<term>Codage</term>
<term>Normalisation</term>
<term>Norme</term>
<term>Numérisation</term>
</keywords>
</textClass>
<langUsage>
<language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">This paper explains why and how the digitization project METAe applies METS (Metadata Encoding and Transmission Standard) as encoding scheme for automatically extracted metadata. In contrast to TEI (Text Encoding Initiative) and other markup languages, METS allows encoding of the whole range of structural, descriptive, and administrative metadata in a systematic way. As the METS schema permits the integration of other existing standards, it provides a highly flexible output that can be converted easily to the individual needs of digital libraries. An innovative aspect of the METAe data structure is the ALTO file (‘Analysed layout and text object’), which contains the layout structures as well as the text passages of book pages. Structural maps of the METS schema are used to compose the logical and the physical structures out of ALTO and image files.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>Autriche</li>
</country>
</list>
<tree>
<country name="Autriche">
<noRegion>
<name sortKey="Stehno, Birgit" sort="Stehno, Birgit" uniqKey="Stehno B" first="Birgit" last="Stehno">Birgit Stehno</name>
</noRegion>
<name sortKey="Egger, Alexander" sort="Egger, Alexander" uniqKey="Egger A" first="Alexander" last="Egger">Alexander Egger</name>
<name sortKey="Retti, Gregor" sort="Retti, Gregor" uniqKey="Retti G" first="Gregor" last="Retti">Gregor Retti</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Wicri/Ticri/explor/TeiVM2/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000244 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 000244 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Wicri/Ticri
   |area=    TeiVM2
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     ISTEX:C82C92A176F34CD3AE19FA346A45C9E863DBF21C
   |texte=   METAe—Automated Encoding of Digitized Texts
}}

Wicri

This area was generated with Dilib version V0.6.31.
Data generation: Mon Oct 30 21:59:18 2017. Site generation: Sun Feb 11 23:16:06 2024